Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
📊 LLM Evals
Specific
model evaluation, benchmarks, evals
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
7105
posts in
10.8
ms
LLM
Evaluation
Metrics
: The Ultimate LLM
Evaluation
Guide
confident-ai.com
·
1d
·
Discuss:
Hacker News
🤨
AI Criticism
TLAi
+
Benchmarks
for Evaluating LLMs
github.com
·
3d
·
Discuss:
Hacker News
🦙
Local LLM
How we compare model quality in
CursorWe
use a hybrid online-offline
eval
process to keep our understanding of model quality aligned with what developers actual...
cursor.com
·
1d
·
Discuss:
Hacker News
⚙️
Performance Profiling
Java 18 to 25
Benchmarks
: How Performance
Evolved
Over Time
repoflow.io
·
2d
·
Discuss:
r/java
,
r/programming
⚙️
Performance Profiling
Leaderboard of
Leaderboards
– A Real-Time Meta-Ranking of AI
Benchmarks
huggingface.co
·
3h
·
Discuss:
Hacker News
🧠
AI
Forecast
evaluation
and
dataset
analysis before modeling
quantsynth.org
·
1d
·
Discuss:
Hacker News
⌚
Quantified Self
Show HN:
Stratum
– SQL that branches and beats DuckDB on 35/46
1T
benchmarks
datahike.io
·
8h
·
Discuss:
Hacker News
🗄️
Database Internals
Less-relevant results
Building a
Zero-Click
AI
Evaluation
Pipeline for Production
hackernoon.com
·
3d
💬
Prompt Engineering
Built a
skill
that
benchmarks
any
skill
spec.workers.io
·
1d
·
Discuss:
Hacker News
🔧
Code Generation
How are people do AI
evals
these days?
news.ycombinator.com
·
2d
·
Discuss:
Hacker News
🤨
AI Criticism
Test
Evals
Are Not
Enough
voratiq.com
·
3d
·
Discuss:
Hacker News
🛠
Developer Experience
Learn
Haskell
in Two
Weeks
vitez.me
·
1h
·
Discuss:
Lobsters
λ
Functional Programming
Alpine
glacier
holds history dating back to the
Romans
. And it’s melting—fast.
popsci.com
·
53m
🔗
Obsidian
MAHA
Institute:
Nix
The Entire Childhood Vaccine Schedule
techdirt.com
·
2h
🐛
Fuzzing
Under the
hood
: The AI powering Firefox’s Shake to
Summarize
blog.mozilla.org
·
11h
💬
Prompt Engineering
Popping
Bottles
ma.tt
·
1h
🍎
Apple
Spring
Meetups
Everywhere
2026
astralcodexten.com
·
1h
🏙️
Boston Tech
Nintendo’s Family
BASIC
Keyboard Gets
USB
Upgrade
hackaday.com
·
52m
⌨️
Mechanical Keyboards
PromptExecution/reqif-opa-mcp
:
ReqIf
GitOps Open Policy Agent MCP interface for LLM evaluation
github.com
·
1d
·
Discuss:
Hacker News
📦
PWA Tooling
Studying with
Ludwig
Lachmann
marginalrevolution.com
·
41m
🧪
Social Science
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help